Gaussian Kernel Width Generator for Support Vector Clustering

نویسندگان

  • SEI-HYUNG LEE
  • KAREN DANIELS
چکیده

Clustering data into natural groupings has important applications in fields such as Bioinformatics. Support Vector Clustering (SVC) does not require prior knowledge of a dataset and it can identify irregularly shaped cluster boundaries. A major SVC challenge is the choice of an important parameter value, the width of a kernel function that determines a nonlinear transformation of the input data. Since evaluating the result of a clustering algorithm is a highly subjective process, a collection of different parameter values must typically be examined. However, no algorithm has been proposed to specify the parameter values. This paper presents a secant-like numerical algorithm that generates an increasing sequence of SVC kernel width values. An estimate of sequence length depends on spatial characteristics of the data but not the number of data points or the data’s dimensionality. The algorithm relies on a function that relates the kernel width value to the radius of the minimal sphere enclosing the images of data points in a high-dimensional feature space. Experimental results with 2D and higher-dimensional datasets suggest that the algorithm yields useful data clusterings.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Simple estimate of the width in Gaussian kernel with adaptive scaling technique

This paper presents a simple estimate to determine the width of Gaussian kernel with the adaptive scaling technique. The Gaussian kernel is widely employed in Radial Basis Function (RBF) network, Support Vector Machine (SVM), Least Squares Support Vector Machine (LS-SVM), Kriging, and so on. It is widely known that the width of the Gaussian kernel in these machine learning techniques plays an i...

متن کامل

kernlab – An S4 Package for Kernel Methods in R

kernlab is an extensible package for kernel-based machine learning methods in R. It takes advantage of R’s new S4 object model and provides a framework for creating and using kernelbased algorithms. The package contains dot product primitives (kernels), implementations of support vector machines and the relevance vector machine, Gaussian processes, a ranking algorithm, kernel PCA, kernel CCA, k...

متن کامل

Clustering via Hilbert space

We discuss novel clustering methods that are based on mapping data points to a Hilbert space by means of a Gaussian kernel. The *rst method, support vector clustering (SVC), searches for the smallest sphere enclosing data images in Hilbert space. The second, quantum clustering (QC), searches for the minima of a potential function de*ned in such a Hilbert space. In SVC, the minimal sphere, when ...

متن کامل

Support Vector Clustering

We present a novel clustering method using the approach of support vector machines. Data points are mapped by means of a Gaussian kernel to a high dimensional feature space, where we search for the minimal enclosing sphere. This sphere, when mapped back to data space, can separate into several components, each enclosing a separate cluster of points. We present a simple algorithm for identifying...

متن کامل

Asymptotic Behaviors of Support Vector Machines with Gaussian Kernel

Support vector machines (SVMs) with the gaussian (RBF) kernel have been popular for practical use. Model selection in this class of SVMs involves two hyperparameters: the penalty parameter C and the kernel width sigma. This letter analyzes the behavior of the SVM classifier when these hyperparameters take very small or very large values. Our results help in understanding the hyperparameter spac...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2004